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Abstract 

Let D(G) be the smallest quantifier depth of a first order formula which is true 
for a graph G but false for any other non-isomorphic graph. This can be viewed as 
a measure for the first order descriptive complexity of G. 

We will show that almost surely D(G) = ©(^n^), where G is a random tree 
of order n or the giant component of a random graph Q(n,-^) with constant c > 1. 
These results rely on computing the maximum of D{T) for a tree T of order n and 
maximum degree I, so we study this problem as well. 

1 Introduction 

This paper deals with graph properties expressible in first order logic. The vocabulary 

consists of variables, connectives (V, A and ->), quantifiers (3 and V), and two binary 

relations: the equality and the graph adjacency (= and ~ respectively). The variables 
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denote vertices only so we are not allowed to quantify over sets or relations. The notation 
G \= A means that a graph G is a model for a sentence A (a first order formula without 
free variables); in other words, A is true for the graph G. All sentences and graphs are 
assumed to be finite. The Reader is referred to Spencer's book j7j (or to [2]) for more 
details. 

A first order sentence A distinguishes G from H if G \= A but H \£ A. Further, we say 
that A defines G if A distinguishes G from any non-isomorphic graph H. In other words, 
G is the unique (up to an isomorphism) finite model for A. 

The quantifier depth (or simply depth) D(A) is the largest number of nested quantifiers 
in A. This parameter is closely related to the complexity of checking whether G \= A. 

The main parameter we will study is D(G), the smallest quantifier depth of a first order 
formula defining G. It was first systematically studied by Pikhurko, Veith and Verbit- 
sky (3] (see also jB]). In a sense, a defining formula A can be viewed as the canonical 
form for G (except that A is not unique): in order to check whether G = H it suffices 
to check whether H |= A. Unfortunately, this approach does not seem to lead to better 
isomorphism algorithms but this notion, being on the borderline of combinatorics, logic 
and computer science, is interesting on its own and might find unforeseen applications. 

Within a short time-span various results on the values of D(G) for order-n graphs ap- 
peared. The initial papers E] studied the maximum of D{G) (the 'worst' case). The 
'best' case is considered by Pikhurko, Spencer, and Verbitsky 0] while Kim, Pikhurko, 
Spencer and Verbitsky [2] obtained various results for random graphs. 

Here we study these questions for trees and sparse random structures. Namely, the three 
main questions we consider are: 

Section l^t What is D tTee (n,l), the maximum of D(T) over all trees of order at most n 
and maximum degree at most 11 

Section St What is D(G), where G is the giant component of a random graph Q(n, -) 
for constant c > 1? 

Section What is D(T) for a random tree T of order n? 

In all cases we determine the order of magnitude of the studied function. Namely, we 
prove that D trce (n,l) = 6(^f), and whp we have D(G) = whenever G is a 

random tree of order n or the giant component of a random graph Q(n, -) with constant 
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c > 1. (The acronym whp stands for 'with high probability', i.e., with probability 1— o(l).) 
Moreover, for some cases involving trees we estimate the smallest quantifier depth of a 
first order formula defining G up to a factor of 1 + o(l). For instance, we show that for 
a random tree T of order n we have whp D{T) = (1 + o(l)) , 1 ? n . 



2 Further Notation and Terminology 

Our main tool in the study of D(G) is the Ehrenfeucht game. Its description can be 
found in Spencer's book [Zj whose terminology we follow (or see Section 2]), so here 
we will be very brief. 

Given two graphs G and G', the Ehrenfeucht game EHRk(G, G') is a perfect information 
game played by two players, called Spoiler and Duplicator, and consists of k rounds, 
where k is known in advance to both players. For brevity, let us refer to Spoiler as 'him' 
and to Duplicator as 'her'. In the i-th round, i = l,...,k, Spoiler selects one of the 
graphs G and G' and marks one of its vertices by i; Duplicator must put the same label i 
on a vertex in the other graph. At the end of the game let xi, . . . , Xk be the vertices of G 
marked 1, . . . , k respectively, regardless of who put the label there; let x[, . . . ,x' k be the 
corresponding vertices in G' . Duplicator wins if the correspondence X{ <-> x\ is a partial 
isomorphism, that is, we require that {xi,Xj} G E{G) iff {x'^x'j} G E{G') as well as that 
Xi = Xj iff x\ = x'y Otherwise, Spoiler wins. 

The key relation is that D(G,G'), the smallest depth of a first order sentence A distin- 
guishing G from G', is equal to the smallest k such that Spoiler can win EHRfe(G, G'). 
Also, 

D(G) = m^D(G,G'), (1) 

see e.g. Lemma 1]. 

Sometimes it will be notationally more convenient to prove the bounds on D(G,G') for 
colored graphs which generalize the usual (uncolored) graphs. Graphs G, G' are colored 
if we have unary relations Ui : V(G) U V{G') — > {0, 1}, i G /. We say that the vertices 
in the set Uf (1) have color i. Note that some vertices may be uncolored and some may 
have more than one color. There are no restrictions on a color class, i.e., it does not 
have to be an independent set. When the Ehrenfeucht game is played on colored graphs, 
Duplicator must additionally preserve the colors of vertices. 
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Colorings can be useful even if we prove results for uncolored graphs. For example, if 
x G V(G) and x' G V(G') were selected in some round, then, without changing the 
outcome of the remaining game, we can remove x and x' from G and G 1 respectively, 
provided we color their neighbors with a new color. (Note that in an optimal strategy of 
Spoiler, there is no need to select the same vertex twice.) 

We will also use the following fact, which can be easily deduced from the general theory of 
the Ehrenfeucht game. Let x,y G V(G) be distinct vertices. Then the smallest quantifier 
depth of a first order formula $(2) with one free variable z such that G |= but 
G y= &{y) is equal to the minimum k such that Spoiler can win the (k + l)-round game 
EHR,t +1 (G, G), where the vertices x\ = x and x[ = y have been selected in the first 
round. 

In this paper In denotes the natural logarithm, while the logarithm base 2 is written as 
log 2 - 

3 General Trees 

Let D tree (n,l) be the maximum of D(T) over all colored trees of order at most n and 
maximum degree at most I. We split the possible range of /, n into a few cases. 

Theorem 1 Let both I and Inn/ In/ tend to the infinity. Then 



In fact, the lower bound can be achieved by uncolored trees. 

In order to prove Theorem^ we need some preliminary results. Let distc(^, y) denote 
the distance in G between x, y G V(G). 

Lemma 2 Suppose x,y G V(G) at distance k were selected while their counterparts 
x',y' G V{G') are at a strictly larger distance (possibly infinity). Then Spoiler can win 
in at most log 2 k + 1 extra moves, playing all of the time inside G. 

Proof. We prove the claim by induction on k. Assume k > 2 and choose an appropriate 
xy-p&th P. Spoiler selects a vertex w G V(G) which is a middle vertex of P, that is, 




(2) 
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k\ = distp(x,w) and k 2 = distp(y,w) differ at most by one. Suppose that Duplicator 
responds with w' G G'. It is impossible that G' — z' contains both an x'w'-p&th of length 
at most k\ and a yW-path of length at most k%. If, for example, the latter does not 
exist, then we apply induction to y,w G G. The required bound follows by observing 
that ki,k 2 < rfl- I 

The same method gives the following lemma. 

Lemma 3 LetG,G' be colored graphs. Suppose thatx,y G V(G) andx',y' G V(G') have 
been selected such that G contains some xy-path P of length at most k such that some 
vertex of P has color c while this is not true with respect to G' . Then Spoiler can win in 
at most log 2 k + 1 moves playing all of the time inside G. 

The same conclusion holds if all internal vertices of P have colors from some fixed set A 
while any x'y'-path of length at most k has a color not in A. I 

Lemma 4 Let T be a tree of order n and let T' be a graph which is not a tree. Then 
D(T, V) < log 2 n + 3. 

Proof. If T' is not connected, Spoiler selects two vertices x', y' G T' from different 
components. Then he switches to G and applies Lemma El winning in at most log 2 n + 3 
moves in total. 

Otherwise, let C' C T' be a cycle of the shortest length I. If I > 2n + 1, then Spoiler picks 
two vertices x',y' at distance at least n in C' (or equivalently in T"). But the diameter 
of T is at most n — 1, Spoiler switches to T and starts halving the :q/-path, making at 
most log 2 n + 3 moves in total, cf . Lemma El 

If I < 2n+ 1, then Spoiler selects some three adjacent vertices of C', say x', z',y' in this 
order. Now, he applies Lemma El with respect to k = I — 2. I 

Proof of TheoremU\ Let us prove the upper bound first. 

Let T be any tree of order at most n and maximum degree at most /. Let T' be an 
arbitrary colored graph not isomorphic to T. By Lemma we can assume that T' is a 
tree. 

In fact, we will be proving the upper bound on the version of the (T, T')-game, wherein 
some distinguished vertex, called the root, is given and all graph isomorphisms must 
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additionally preserve the root. (This can be achieved by introducing a new color Uq 
which is assigned to the root only.) The obtained upper bound, if increased by 1, applies 
to the original function D(T, T') because we can regard X\ and x[, the first two moves of 
the Ehrenfeucht game, as the given roots. 

It is easy to show that T contains a vertex x G T such that any component of T — x 
has order at most ^. We call such a vertex a median of T. Spoiler selects this vertex x; 
let Duplicator reply with x' . We can assume that the degrees of x and x' are the same: 
otherwise Spoiler can exhibit this discrepancy in at most I + 1 extra moves. 

We view the components of T — x and T' — x' as colored rooted graphs with the neighbors 
of x and x' being the roots. As T ^ T', some component C\ has different multiplicities 
mi and m[ in T — x and T' — x' . As d(x) = d(x'), we have at least two such components. 
Assume that for C\ and 2 we have mi > m[ and m 2 < m' 2 . By the condition on the 
maximum degree, m' 1 + m 2 < I — 1. Hence, min(m'i,m 2 ) < ^p. Let us assume, for 
example, that rn'i < Spoiler chooses the roots of any m[ + 1 Oi-components of T — x. 
It must be the case that some vertices y G V(T) and y' G V(T') have been selected, so 
that y lies in a Ci-component F C T — x while lies in a component F' C T' — x not 
isomorphic to Ci. Let rii be the number of vertices in F. By the choice of x, ri\ < ^. 

Now, Spoiler restricts his moves to V(i r ') U V(i r "). If Duplicator moves outside this set, 
then Spoiler uses Lemma El winning in at most log 2 n + 0(1) moves. Otherwise Spoiler 
uses the recursion applied to F. 

Let f(n,l) denote the largest number of moves (over all trees T,T' with v(T) < n, 
A(T) < I, and T ^ T') that Duplicator can survive against the above strategy with the 
additional restriction that a situation where Lemma El can be applied never occurs and 
we always have that d(x) = d(x f ). Clearly, 



Denoting no = n and unfolding (jlj) as long as rii > 1, say s times, we obtain that f(n, I) 
is bounded by the maximum of 



£> tree (n, I) < fin, I) + log 2 n + I + 0(1). 



(3) 



As mi < we get the following recursive bound on /. 






(5) 
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over all sequences m, . . . , n 8 such that 

l<ni<-^, ie[s]. (6) 

Note that the restrictions (jOJ force s to be at most log 2 n. Let us maximize (0) over all 
sGN and real n^'s satisfying (JSJ). 

It is routine to see that for the optimal sequence we have 2 < < i G [si; moreover, 
both these inequalities can be simultaneously strict for at most one index i. 

Let t be the number of times we have 7ij_i = 2nj. The bound (0) reads 

/(n, Z) - 2 log 2 n < 2t + (s - t) (7) 

Given that 2 f (^-) s_ * _1 < n, the right hand side of (J7J) is maximized for t = O(logZ) and 
s = (1 + o(l)) j^j, implying the upper bound (J2J) by (J3J). 

Let us prove the lower bound. Let k = |_Z/2J . Define Go = -Z^u-i and G f = i^i,/_2- Let 
r G V(Go)) r o ^(^o) ^ e their roots. Define inductively on z the following graphs. Gi is 
obtained by taking k copies of and k — 1 copies of G^_ 1; pairwise vertex-disjoint, plus 
the root connected to the root of each copy of and G'^. We have cZ(rj) < Z — 1. 
The graph is defined in a similar way except that we take k — 1 copies of Gj_i and k 
copies of G' i _ 1 . Let i be the largest index such that m.ax(v(G i ),v(G' i )) < n. 

Let us disregard all roots, i.e., view Gj and G'j as usual (uncolored) graphs. Note that 
the trees Gi and G\ are non-isomorphic as for every j we can identify the level-j roots as 
the vertices at distance j + 1 from some leaf. 

Define gj — (k — l)j + I — 2, j G [0, i]. Let us show by induction on j that Duplicator 
can survive at least gj rounds in the (Gj, G^)-game. This is clearly true for j = 0. Let 
j > 1. If Spoiler claims one of rj,r'j then Duplicator selects the other. If Spoiler selects 
a vertex in a graph from the "previous" level, for example F C Gj with F = G'_ 1; then 
Duplicator chooses an F' C F' = G'_ x and keeps the isomorphism between F and F'. 
So any moves of Spoiler inside V(F) U V(F') will be useless and we can ignore F and F'. 
Thus it takes Spoiler at least k — 1 moves before we are down to the pair (G^-i, G'j-i), 
which proves the claim. 

Thus we have D(Gi) > D(G t , G<) > # = (| + o(l)) , finishing the proof. I 

Remark. Verbitsky |5] proposed a different argument to estimate D tree (n, I) which gives 
a weaker bounds than those in Theorem G] but can be applied to other classes of graphs 
with small separators. 
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Let us study D tree (n, I) for other I, n. The methods have much in common with the proof 
of Theorem ^ so our explanations are shorter. 

Theorem 5 Let an integer t > 1 be fixed. Suppose that l,n — > oo so that n > l l but 
n = o(/ m ). Then D tree (n, I) = (^p + o(l)) /. In fact, the lower bound can be achieved by 
uncolored trees. 

Proof. The lower bound is proved by the induction on t. If t — 1, take T\ = K\^2- 
One needs at least I — 1 moves to distinguish it from T[ = K\j-\. Let a = \l/2\ and 
b = [7/2] . Suppose we have already constructed T t _i and T[_ x , rooted trees with < 
vertices such that the root has degree at most I — 1. To construct T t take a copies of Tt-i 
and 6—1 copies of T' t _ x and connect them to the common root. For T[ we take a — 1 
and b copies respectively. The degree of the main root is a + b — 1 = 1 — las required. 
The order of T t is at most (a + b — 1)/* _1 + 1 < Also, Spoiler needs at least a moves 
before reducing the game to (T t _ 1 ,T/„ 1 ) (while, for t = 1, I moves are needed to finish 
the game), giving the required bound. 

Let us turn to the upper bound. Spoiler uses the same strategy as before. Namely, he 
chooses a median igT and of two possible multiplicities, summing up to I, chooses the 
smaller. Let mi + 1, m<i + 1, . . . , + 1 be the number of moves per each selected median. 
We have n > rii=i m *- Also, we have k < log 2 n because we always choose a median. 
Given these restrictions, the inequalities rrii < 1/2, i G [k — 1], and < I — 1, the sum 
^2i=i m i i s maximized if = 1 — 1 and as many as possible rrij = 1/2 are maximum 
possible. We thus factor out 1/2 at most t — 1 times until the remaining terms have the 
product (and so the sum) o(l). Thus, 

J2(mi + 1) < log 2 n + ^mi < /+ ^~ ^ +o(/), 

1=1 8=1 

completing the proof. I 

Theorems^andEldo not cover all the possibilities for n, /. The asymptotic computation in 
the remaining cases seems rather messy. However, the order of magnitude of Z? tree (n, I) 
is easy to compute with what we already have. Namely, Theorem El implies that for 
I = 9(n*) with fixed t G N we have D tree (n,l) = 0(1). Also, if I > 2 is constant, then 
D tree (n,l) = 0(lnn), where the lower bound follows from considering the order-n path 
and the upper bound is obtained by using the method of Theorem 
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4 The Giant Component 



Let c > 1 be a constant, p = £ , and G be the giant component of a random graph Q{n,p). 
Here we show the following result. 

Theorem 6 Let c > 1 be a constant, p = c/n, and G be the giant component of Q(n,p). 
Then whp 



This result allows us to conclude that for any p = 0(n x ) a random graph H e Q(n,p) 
satisfies whp 



The proof is an easy modification of that in [2] where the validity of (JH1) was established 
for p < (1.19... + o(l)) n^ 1 . The lower bound in Q comes from considering the graph H' 
obtained from H by adding an isolated vertex (and noting that whp H has (e _rap + o(l)) n 
isolated vertices). The method in |2] shows that the upper bound JHj) can fail only if 
D(G) > (e~ np + o(l)) n, where G is the giant component of H . (And p/n ~ 1.19... is the 
moment when v(G) ~ e~ np .) 

4.1 Upper Bound 

The structure of the giant component is often characterized using its core and kernel 
(e.g., see Janson, Luczak, and Rucihski jU Section 5]). We follow this approach in the 
proof of the upper bound in (JHJ). Thus, we first bound D(G) from above for a graph G 
with small diameter whose kernel fulfills some "sparsness" conditions. Then, we show 
that these conditions hold whp for the kernel of the giant component of a random graph. 

4.1.1 Bounding D(G) Using the Kernel of G 

The core C of a graph G is obtained by removing, consecutively and as long as possible, 
vertices of degree at most 1. If G is not a forest, then C is non-empty and 5(C) > 2. 

First we need an auxiliary lemma which is easily proved, similarly to the auxiliary lemmas 
in Section El by the path-halving argument. 




(8) 



D(H) = (e-" p + o(l))n. 



(9) 
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Lemma 7 Let G,G' be graphs. Suppose x G V(G) and x' G V(G') have been selected 
such that G contains some cycle P 3 x of length at most k while G' does not. Then 
Spoiler can win in at most log 2 k + 0(1) moves, playing all time inside G. I 

Lemma 8 Let G, G' be graphs and C, C' be their cores. If Duplicator does not preserve 
the core, then Spoiler can win in at most log 2 d+0(l) extra moves, where d is the diameter 
ofG. 

Proof. Assume that diam(G') = diam(G) for otherwise we are easily done. Suppose that, 
for example, some vertices x G C and x' G 0C' have been selected. 

If x lies on a cycle C\ C C, then we can find such a cycle of length at most 2d+ 1. Of 
course, G' cannot have a cycle containing x' , so Spoiler wins by Lemma [3 in log 2 (2<i + 
1) + 0(1) moves, as required. 

Suppose that x does not belong to a cycle. Then G contains two vertex-disjoint cycles 
C±, C2 connected by a path P containing x. Choose such a configuration which minimizes 
the length of P 3 x. Then the length of P is at most d. Spoiler selects the branching 
vertices yi G V{C\) fl V(P) and y 2 G V{C 2 ) H V(P). If some Duplicator's reply y[ is not 
on a cycle, we done again by Lemma [7| So assume there are cycles C[ 3 y\. In G we have 

dist(yi, y 2 ) = dist(yi, x) + dist(y 2 , x). (10) 

As x' G" C', any shortest x'^-path and x'y 2 -path enter x' via the same edge {x', z'}. But 
then 

dist(yl, y' 2 ) < dist(y[, z') + dist(y 2 , z') = dist(^, x') + dist(y 2 , ar') - 2. (11) 

By (fTUj) and (|TT|) . the distances between x,yi,y 2 cannot be all equal to the distances 
between x',y[,y 2 . Spoiler can demonstrate this in at most log 2 (dist(yi, y 2 )) + 0(1), as 
required. I 

In order to state our upper bound on D(G) we have to define a number of parameters 
of G. In outline, we try to show that any distict x,y G V(C) can be distinguished by 
Spoiler reasonably fast. This would mean that each vertex of C can be identified by a 
first order formula of small depth. Note that G can be decomposed into the core and 
a number of trees T x , x G V(C), rooted at vertices of C. Thus, by specifying which 
pairs of vertices of C are connected and describing each T x , x G V(C), we completely 
define G. However, we have one unpleasant difficulty that not all pairs of points of C 
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can be distinguished from one another. For example, we may have a pendant triangle on 
{x,y,z} with d(x) = d(y) = 2, in which case the vertices x and y are indistinguishable. 
However, we will show that whp we can distinguish any two vertices of degree 3 or more 
in C, which suffices for our purposes. 

Let us give all the details. For x G V(C), let T x C G denote the tree rooted at x, i.e., T x 
is a component containing x in the forest obtained from G by removing all edges of C. 
Let 



where D(T X ) is taken with respect to the class of graphs with one root. 

Let the kernel K of G be obtained from C by the serial reduction where we repeat as 
long as possible the following step: if C contains a vertex x of degree 2, then remove x 
from V(C) but add the edge {y, z) to E(C) where y, z are the two neighbors of x. Note 
that K may contain loops and multiple edges. We agree that each loop contributes 2 to 
the degree. Then we have S(K) > 3. 

Let u = A(G) and d be the diameter of G. It follows that each edge of K corresponds 
to the path P in C of length at most 2d. 

Let I be an integer such that every set of v < 61 vertices of K spans at most v edges in 
K. (Roughly speaking, we do not have two short cycles close together.) 

For {x,y} G E(K) let A XjV be the set of vertices obtained by doing breadth first search 
in K — x starting with y until the process dies or, after we have added a whole level, we 
reach at least k = 2 l ~ 2 vertices. Let K XiV = K[A X:V U {x}]. 

The height of z G V(K x>y ) is the distance in K — x between z and y. It is easy to deduce 
from the condition on short cycles that each K x>y C K — x has at most one cycle and the 
maximum height is at most I. In fact, the process dies only in the case if y is an isolated 
loop in K — x. For xy G E(K) let G X)V be a subgraph of G corresponding to K xy . We 
view K xy and G x>y as having two special roots x and y. 

Here is another assumption about G and / we make. Suppose that for any xx', yy' G E{K) 
if K XjX i and K y<y i have both order at least k and A XjX i fl A y ^i = 0, then the rooted graphs 
G X)X and G y<y i are not isomorphic. Let 



t = max{D(T x ) : x G V(C)}, 



l(\nu + In Inn + /) 



+ 2u + log 2 d, 



(12) 
(13) 



b = 



In/ 

b + t + u + 2 log 2 d. 
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Lemma 9 Under the above assumptions on G, we have D{G) < b + 0(1). 

Proof. Let G' ^ G. Let C, K' be its core and kernel. We can assume that A(G') = u 
and its diameter is d for otherwise Spoiler easily wins in u + 2 or log 2 d + 0(1) moves. 

By Lemma |H1 it is enough to show that Spoiler can win the Ehrenfeucht (G, G')-game in 
at most b — log 2 d + 0(1) moves provided Duplicator always respects C and K. Call this 
game C. 

Color V(K) U E(K) and V(C) by the isomorphism type of the subgraphs of G which 
sit on a vertex/edge. We have a slight problem with the edges of K as the color of an 
unordered edge may depend in which direction we traverse it. So, more precisely, every 
edge of K is considered as a pair of ordered edges each getting its own color. Do the 
same in G' . As G ^ G', the obtained colored digraphs K and K' cannot be isomorphic. 
Call the corresponding digraph game JC. 

Claim 1 If Spoiler can win the game K, in m moves, then he can win C in at most 
m + t + u + log 2 d + 0(1) moves. 

Proof of Claim. We can assume that each edge of K' corresponds to a path in G' of 
length at most 2d+ 1: otherwise Spoiler selects a vertex of C' at the O'-distance at least 
d + 1 from any vertex of K' and wins in log 2 d + 0(1) moves. 

Spoiler plays according to his /C-strategy by making moves inside V(K) C V(G) or 
V(K') C V(G'). Duplicator's reply are inside V(K'), so they correspond to replies in the 
/C-game. In at most m moves, Spoiler can achieve that the set of colored edges between 
some selected vertices x,y G K and x', y' G K' are different. (Or loops if x = y.) 

In at most u + 1 moves, Spoiler can either win or select a vertex z inside a colored xy-p&th 
P (an edge of K) such that z' either is not inside an x'y'-path (an edge of K') or its path 
P' 3 z' has a different coloring from P. In the former case, Spoiler wins by Lemma El in 
G there is an xy-p&th containing z and no vertex from K. 

Consider the latter case. Assume that \P\ = \P'\, for otherwise we are done by Lemma El 
Spoiler selects w G P such that for the vertex w' G P' with distp(w,x) = dist pi (w',x') 
we have T w ^ T' w ,. If Duplicator does not reply with it/, then she has violated distances. 
Otherwise Spoiler needs at most t extra moves to win the game T on (T W ,T^,) (and at 
most log 2 d + 0(1) extra moves to catch Duplicator if she does not respect T). I 

It remains to bound D(K), the colored digraph version. This requires a few preliminary 
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results. 

Claim 2 For any {x, x'} G K we have D(K XjX ) < bo + 0(1) in the class of colored 
digraphs with two roots, where bo is defined by ()12|) . 

Proof of Claim. Let T = ^ and T" ^ T. If T is a tree, then we just apply a version of 
Theorem ^ using the order (<u2 l ) and maximum degree (<«)• Otherwise, Spoiler first 
selects a vertex zeT which lies on the (unique) cycle. We have at most u — 1 components 
in T — z, viewing each as a colored tree where one extra color marks the neighbors of z. 
As T ^ T', in at most u + 1 moves we can restrict our game to one of the components. 
(If Duplicator does not respect components, she loses in at most log 2 d + 0(1) moves.) 
Now, one of the graphs is a colored tree, and Theorem ^ applies. I 

Claim 3 For every two distinct vertices x, y G V(K) there is a first order formula 
*&x,y{z) with one free variable and quantifier rank at most bo + log 2 d + 0(1) such that 
G \= <& Xt y(x) and G \£ $ x>y (y). (Note that we have to find $ Xj2/ for x, y in the kernel only, 
but we evaluate & XjV with respect to G.) 

Proof of Claim. To prove the existence of $> Xj y we have to describe Spoiler's strategy, 
where he has to distinguish (G, x) and (G, y) for given distinct x,y G K. 

If the multiset of isomorphism classes K XiX r, over {x, x'} G E(K) is not equal to the 
multiset {K y>y / : {y,y'} G E(K)}, then we are done by Claim 2. So let us assume that 
these multisets are equal. 

Note that an isomorphism K X)X i = K y ^i implies an isomorphism G x ^ x > = G y>y '. Also, by 
our assumption on /, the isomorphism G X)X * = G y ^ y > implies that V(K XjX i) r\V(K y<y i) ^ 0. 

At most one neighbor of x can be an isolated loop for otherwise, we get 3 vertices spanning 
4 edges. The same holds for y. As the height of any K a ^ is at most /, we conclude that 
distx(£, y) < 21. A moment's thought reveals that there must be a cycle of length at 
most 4Z containing both x and y. But this cycle rules out the possibility of a loop adjacent 
to x or to y. Thus, in order to exclude 2 short cycles in K close to each other, it must 
be the case that dist(x, y) < I — 1 and dx(x) = dx(y) = 3. Moreover, let xi,X2,xs and 
2/i ? 2/2 , 2/3 be the neighbors of x and y such that G XjXi = G y ^ Vi \ then (up to a relabeling 
of indices), we have the following paths between x and y: either (x, x±, . . . , yi, y) and 
(x, x 2 , . . . , 2/3, y) or (x, x u . . . , y 3 , y) and (x, x 2 ,...,yx,y) 

Now, K XjX3 is not isomorphic to K XiXl nor to K XiX2 by the vertex-disjointness. (Note that 
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it is not excluded that K x>Xl = K XX2 : they may intersect, for example, in y.) 

But then z = x is different from z = y in the following respect: the (unique) short cycle 
of K containing z has its two edges entering z from subgraphs isomorphic to K XXl and 
Kx,x 2 (while for z = y the corresponding subgraphs are isomorphic to K X)Xl and K XjX3 ). 

This can be used by Spoiler as follows. Spoiler selects £i,x 2 . If Duplicator replies with 
y 3 , then Spoiler can use Claims 2 and 3 because K ym is not-isomorphic to K x>Xl nor to 
Kx,x 2 - Otherwise, the edge {x, x 2 } is on a short cycle while {y,y2} is not. Spoiler uses 
Lemma I 

By Lemma El we can find $^(x), a formula of rank at most log 2 d + 0(1) which, with 
respect to G, evaluates to 1 for all x G V(K) and to otherwise. More precisely, Lemma|Hl 
gives a formula $c(^) testing for x G V(C). But V(K) C V(C) are precisely the vertices 
of degree at least 3 in C. 

Now, as it is easy to see, for any x G K the formula 

$ x {v) := $ K {v) A /\ $ x , y {v) (14) 

yeV(K)\{x} 

identifies uniquely x and has rank at most log 2 d + b + 0(1). 

Take x G V(K). If there is no x' G V(K') such that G' \= $ x (x'), then Spoiler selects 
x. Whatever Duplicator's reply x' is, it evaluates differently from i on $ r Spoiler can 
now win in at most D(Q X ) moves, as required. If there are two distinct y',z' G K' such 
that G' |= <& x (y') and G' |= §> x (z'), then Spoiler selects both y' and z' . At least one of 
Duplicator's replies is not equal to x, say, y x. Again, the selected vertices y G V(K) 
and y' G V(K') are distinguished by $> x , so Spoiler can win in at most extra D(<& x ) 
moves. 

Therefore, let us assume that for every x G V(K) there is the unique vertex x' = <p(x) G 
V(K') such that G' \= $> x (x'). Clearly, <fr is injective. Furthermore, <ft is surjective 
for if x' G" (j)(V(K)), then Spoiler wins by selecting x' G V(K') and then using & x , 
where x G V(K) is Duplicator's reply. Moreover, we can assume that Duplicator always 
respects for otherwise Spoiler wins in at most log 2 d + bo + 0(1) extra moves. 

As K ^ K', Spoiler can select x, y G V(K) such that the multisets of colored paths (or 
loops if x = y) between x and y and between x' = 4>(x) and y' = 4>(y) are distinct. Again, 
this means that some colored path has different multiplicities and Spoiler can highlight 
this in at most u + 1 moves. Then in at most log 2 I + 0(1) moves he can ensure that 
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some vertices z G V(K) and z' G V(K') are selected such that the removed trees T z and 
T z i rooted at z and z' are not isomorphic, compare with Lemma El 

Now, by the definition of t, at most t moves are enough to distinguish T z from T' z , (plus 
possible log 2 d + 0(1) moves to catch Duplicator if she replies outside V(T Z ) U V(T Z /)). 

This completes the proof of Lemma I 
4.1.2 Probabilistic Part 

Here we estimate the parameters from the previous section. As before, let G be the giant 
component of Q(n, £ ), let C be its core, etc. 

It is well-known that whp u = O( pp^ ) and d = 0(\nn). 

Lemma 10 Whp every edge of K corresponds to at mostO(lnn) vertices ofG. Similarly, 
for any x G V(C) we have v(T x ) = O(lnn). 

Proof. The expected number of .fT-edges, each corresponding to precisely % = 0(\nn) 
vertices in G is at most 

But ec < e c for c > 1, so if % is large enough, i > Minn, then the expectation is o(n~ 3 ). 
Similarly, the expected number of vertices x with v(T x ) = i = O(lnn) is at most 

I 

In particular, our results from Section El imply that whp t = 0( . ll ? n ). 

Let, for example, / = 2 In Inn. Thus k/lnn — > oo, where A; = 2^ 2 . It remains to prove 
that this choice of / satisfies all the assumptions. 

Lemma 11 Whp any set of s < 61 vertices of K spans at most s edges. 

Proof. A moment's thought reveals that it is enough to consider sets spanning connected 
subgraphs only. 
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Let L = Minn be given by Lemma ITU1 The probability that there is a set S such that 
\S\ = s < 61 and K[S] is a connected graph with at least s + 1 edges is at most 



+ e ( n ) ( i) 2 e n 00 & + 2) - P y^- 2) 

s=4 W W <^,...,4+i<£ <=i V i7 



=4 0<<i,.../ s+ i<L i=l v 7 s=4 

The lemma is proved. I 



Lemma 12 W-Tip if does not contain four vertices x,x',y,y' such that xx',yy' € E(K), 
v{K x>x t) > k, A x>x , n = 0, and G x ^ = G y>y >. 



Proof. Given c, choose the following constants in this order: small €\ > 0, large Mi, large 
M 2 , small e 2 > 0, and large M 3 . 

Consider breadth-first search in G — x starting with x 1 . Let L\ = {x 1 }, L 2 , L 3 , etc., be 
the levels. Let Tj = {x} U (U*- =1 Lj). Let s be the smallest index such that \T S \ > M 2 lnn. 

Chernoff's bound implies that the probability of \T S \ > 2cM 2 lnn is o(n~ 2 ). Indeed, this 
is at most the probability that the binomial random variable with parameters (n, - x 
M 2 In n) exceeds 2cM 2 In n. 

Similarly, with probability 1 — o(n~ 3 ) we have \L i+ i\ = (c ± e 2 )|Lj| provided i > s and 
\Ti\ = o(n). Hence, we see that from the first time we reach 2M 2 lnn vertices, the levels 
increase proportionally with the coefficient close to c for further 0(lnn) steps. 

Take some i with |Tj| = O(lnn). The sizes of the first 0(lnn) levels of the breadth-first 
search from the vertices of Li can be bounded from below by independent branching 
processes with the number of children having the Poisson distribution with mean c — e 2 . 
Indeed, for every active vertex v choose a pool P of [(1 — a )n] available vertices and let 
v choose its neighbors from P, each with probability c/n. (The edges between v and 0P 
are ignored.) If v claimed r neighbors, then, when we take the next active vertex u, we 
add extra r vertices to the pool, so that its size remains constant. 

With positive probability pi the ideal branching process survives infinitely long; in fact, 
Pi is the positive root of 1 — p\ — e~ cpi . Let 

c^e — c 

p 2 = max — ■ — < 1. 

j>o j\ 
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The numbers pi > and P2 < 1 are constants (depending on c only). 

Take the smallest i such that |Tj| > 2cM 3 ln?T,. The breadth-first search inside G goes 
on for at least M\ further rounds (after the i-th round) before we reach a vertex outside 
G XjX i. We know that > (— — ei) |Tj| because the levels grow proportionally from the 
s-th level. Let Z consist of the vertices of Li for which the search process in G — x goes 
on for at least Mi further levels before dying out. By Chernoff's bound, with probability 
1 - o{n- 2 ) we have \Z\ > f 

Let us fix any K x x i having all the above properties and compute the expected number of 
copies of K XtX / in G. More precisely, we compute the expected number of subgraphs of G 
isomorphic to G\Tj\ such that a specified |Z|-subset of the last level has specified trees, 
each of height at least Mi, sitting on it. The expected number of G[Tj]-subgraphs is at 
most n'^'pf 1 \ This has to be multiplied by 

(p 2 + o(l)) M ^<p^ {c - 1)pilT ' l/4c : 

because if we want to get a given height-Mi tree, then at least t times we have to match 
the sum of degrees of a level, each coincidence having probability at most p 2 + o(l). As 
the constant Mi can be arbitrarily large, we can make the total expectation o{n~ 2 ). 

Markov's inequality implies the lemma. I 

Finally, putting all together we deduce the upper bound of Theorem |H1 



4.2 Lower Bound 

Let / = (1 — e) r^p- for some e > 0. We claim that whp the core C has a vertex i adjacent 
to at least / leaves of G. (Then we have D(C) > I + 1: consider the graph obtained from 
C by adding an extra leaf to i.) 

Let us first prove this claim for the whole random graph H e Q{n, c/n) (rather than for 
the giant component G C H). For i G [n] let Xi be the event that the vertex i is incident 
to at least / leaves. It is easy to estimate the expectation of X = X^=i 

= (1 + 0(1))— ► 00. 
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Also, for i ^ j, 



e( Xi ax 3 ) = 2 )( n \ 2 y\i- P f^-^ 

= (l + o(l)) 

The second moment method gives that X is concentrated around its mean. 

Now, let us reveal the vertex set A of the 2-core of the whole graph H . When we expose 
the stars demonstrating Xi — 1 one by one, then for each % the probability of i G A is 
— + o(l). The sharper results of Luczak [3] imply that whp the core C of the giant 
component has size 0(n). Hence, whp at least one vertex i with X, = 1 belongs to the 
V(C), giving the required. 



5 Random Trees 

We consider the probabilistic model T(n), where a tree T on the vertex set [n] is selected 
uniformly at random among all n n ~ 2 trees. In this section we prove that whp D(T) is 
close to the maximum degree of T. 

Theorem 13 Let T e T(n). Whp D(T) = (1 + o(l))A(T) = (1 + o(l))^. 

Let T{n, k) be a forest chosen uniformly at random from the family of T n j~ of all forests 
with the vertex set [n], which consist of k trees rooted at vertices 1,2, ... ,k. Note that 
a random tree T e T(n) can be identified with Tin, 1). We recall that \T n ,k\ = kn n ~ k ~ 1 , 
see e.g. Stanley [8,, Theorem 5.3.2]. We start with the following simple facts on T{n, k). 

Lemma 14 Let k = k(n) < ln 4 n. 

(i) The expected number of vertices in all trees of T[n, k) , except for the largest one, 
is 0(k^/n). 

(ii) The probability that T{n, k) contains precisely I, £ = 0, . . . ,k — 1, isolated vertices 
is given by (1 + 0{k 2 /n)) (Y)^ 1 ~ e -1 )*-' -1 . 

(Hi) The probability that the roots of J r (n,k) have more than k(l + 1/lnn) + 21n 2 n 
neighbors combined is o(n -3 ). 



18 



(iv) The probability that t given roots of J-{n, k) have degree at least s > 4 each is 
bounded from above by (2/(s — 1)!)^ 

Proof. Ifi<n/2 + l, then the probability that a tree rooted at a vertex j = 1,2, ... ,k 
in the forest T{n, k) has precisely i vertices is given by 

'n - k\ .j_ 2 (k — l)(n — i) n - i ~ k 



o(r 3 / 2 ) . 



i — 1 / kn n k 1 

Consequently, the expectation of the sum of the orders of all components of .F(n, k) with 
at most n/2 + 1 vertices is 0{k^/n). 

In order to see (ii) note that from the generalized inclusion-exclusion principle the stated 
probability equals 

V ( l \ r-i ( k \ ^-^-'T'"' 1 
-('^(^S ag-fe-Vo. ^ <15) 

=( i+o (^))h 1 ) e " <(i - e " T "'" 1 ' 



For the probability that precisely m (> /c) vertices of T{n, k) are adjacent to the roots, 
Stirling's formula gives 

fn - k\ km m{n - ky-*-™- 1 < / | Q /gu/ e^H \"» 

V 771 y k Tl n k ^ \ \ 77, / / \ / 



For every x, < x < 1, we have xe 1 x < e ^ x ) 2 / 2 ; so the above formula is bounded 
from above by exp(— ^""^ )■ Since 

£ -(-^ -<»-*>■ 

m>fc(l+l/lnra)+21n 2 ra 

the assertion follows. 

For A; = 1 the probability that a given root has degree at least s is bounded from above 
by 

'n - 1\ t{n - i)"-*-2 i 2 

f J " ^ (f = l)! " (7=1)! ■ 

If we fix some i > 2 roots, then if we condition on the vertex sets of the £ corresponding 
components, the obtained trees are independent and uniformly distributed, implying the 
required bound by the above calculation. | 
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Using the above result one can estimate the number of vertices of T e T{n) with a 
prescribed number of pendant neighbors. 

Lemma 15 Let X^ m denote the number of vertices in T e T(n) with precisely £ neigh- 
bors of degree one and m neighbors of degree larger than one. Let 

A C {(£,m): < £ < Inn, l<m<lnn}, 

be a set of pairs of natural numbers and X A = YltemyeA-^^- Then, the expectation 

E(X A ) = (l + o(l))n ,L (17) 

(£,m)GA ' V 

and E(X A (X A - 1)) = (1 + o(l)) (E(X A )) 2 . 



Proof. Using Lemma ITffi ii) we get 

'n- 1\ fm + e- 1\ _ lf , _i, m _i (m + t){n - l) n ~ m ~ e - 2 



(£,m)eA 



which gives ()17|) . In order to count the expected number of pairs of vertices with pre- 
scribed neighborhoods one needs first to choose £ + m neighbors of a vertex and then 
compute the expectation of the number of vertices of a given neighborhood in the random 
forest J-(n, £ + m) obtained in this way. However, the largest tree of J-{n, £ + m) has the 
expectation n — 0{y/nhin) ( Lemma IT^j) ; one can easily observe that this fact implies 
that the expected number of vertices with a prescribed neighborhood in T{n, £ + m) is 
(1 + o(l)) E(X A ), and so E(X A (X A - 1)) = (1 + o(l)) (E(X A )) 2 . I 

As an easy corollary of the above result we get a lower bound for D{T{n)). 
Theorem 16 Let T e T(n). Whp D(T) > (1 - o(l))A(T) = (1 - o(l)) 

Proof. Since whp the maximum degree is (1 — o(l))lnn/lnlnn, in order to prove the 
assertion it is enough to show that whp T contains a vertex t; with 

Inn ,^ „ s 



4 = (i-o(i)) 



In In n 

neighbors of degree one; indeed, to characterize such a structure Spoiler needs at least 
£q + 1 moves. Using Lemma ITB1 we infer that the for the number of vertices X^ of T with 
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exactly I neighbors of degree 1 we have E{Xg) = 0(e e n/£\). Thus, one can choose £q so 
that (dH|) holds and E(X io ) -> oo. Then, due to Lemma ESJ VarpQj = o((£(X A) )) 2 ), 
and Chebyshev's inequality implies that whp X eo > 0. I 

Let us state another simple consequence of Lemma ITU which will be used in our proof of 
Theorem 1131 Here and below N r (v) denotes the r-neighborhood of v, i.e., the set of all 
vertices of a graph which are at the distance r from v, and N< r (v) = Ui=o-^( r )- 

Lemma 17 Let r = r (n) = [7 Inn] . Then, whp the following holds for every vertex v 
ofTeT{n): 

(i) \N< ro (v)\ < 10 8 ln 4 n, 

(ii) N< ro (v) contains fewer than lnn/(lnlnn) 2 vertices of degree larger than (lnlnn) 5 . 

Proof. For s < r let W s = U* =0 Ni(v). Note that, conditioned on the structure of the 
subtree of T induced by W s for some s < r , the forest T — W s -\ can be identified 
with the random forest on n — \W s -\\ vertices, rooted at the set W s . Thus, it follows 
from Lemma fT^T iii^) that once for some i we have |A^(v)| > 41n 3 n then |A^ +1 (w)| < 
\Ni(v)\{l + 2/ Inn), so that 

\N< ro (v)\ < 4r ln 3 n(l + 2/lnn) r ° < 10 8 ln 4 ra . 

In order to show (ii) note that (i) and Lemma fT4T iv) imply that the probability that, for 
some vertex v, at least I = |lnn/(lnlnn) 2 J vertices of N< ro (v) have degree larger than 
m = (lnlnn) 5 is bounded from above by 

/ln 5 n\ / 2 V ( 2eln 5 n \ l mf 

I 

In our further argument we need some more definitions. Let T be a tree and let v be 
a vertex of T. For a vertex w G N r (v) let P vw denote the unique path connecting v to 
w (of length r). Let the check Ch(v; P vw ) be the binary sequence bo ■ ■ -b r , in which, for 
i = 0, . . . , r, bi is zero (resp. 1) if the z-th vertex of P vw is adjacent (resp. not adjacent) 
to a vertex of degree one. Finally, the r -checkbook Ch r (v) is the set 

Ch r (t>) = {Ch(t>; P vw ) : w & N r and P vw is a path of length r}. 
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Note that a checkbook is not a multiset, i.e., a check from Ch r (v) may correspond to 
more than one paths P vw . 

Our proof of the upper bound for D(T(n)) is based on the following fact. 

Theorem 18 Let tq = [7 Inn]. Whp for each pair P vw , P v > w > of paths of length ro in 
T G T{n) which share at most one vertex, the checks Ch(v; P vw ) and Ch(v; P v ' w ') are 
different. 

Proof. Let C = del(T) denote the tree obtained from T by removing all vertices of degree 
one. From Lemma ITB1 it follows that whp the tree C has (1 — e" 1 — o(l))n vertices of 
which ^ ^ 

(1 + o(l)) n Q -^r = ( ex P( e ~ X " !) - e ~' + o(l)) n 

vertices have degree one and 

an = (1 — exp(e~* — 1) + o(l)) n. 

vertices have degree greater than one. 

Moreover, among the set B of (e -1 + o(l))n vertices removed from T, 

OO _ £_ I 

(1 + (l))n = ^ + ex P( e_1 - 2 > 

1=0 

were adjacent to vertices which became pendant in C. Let B' denote the set of the 
remaining 

(e -1 - exp(e^ 1 - 2) + o(l))n = (p + o{l))n 

vertices which are adjacent to vertices of degree at least two in C. Note that, given 
C = del(T), each attachment of vertices from B \ B' to pendant vertices of C such that 
each pendant vertex of C get at least one vertex from B\B', as well as each attachment 
of vertices from B' to vertices of degree at least two from C is equally likely. 

Let P vw , P v > w >, be two paths of length ro in T which share at most one vertex. Clearly, 
each vertex of P vw , except, maybe, at most two vertices at each of the ends, belong to 
C and have in it at least two neighbors; the same is true for P v / W >. Since (po + o(l))n 
vertices from B' are attached to the an vertices of degree at least two in C at random, 
the probability that one such vertex gets no attachment is 

po = (1 + o(l)) 1 = (1 + o(l)) e~ po/a = 0.692... + o(l). 

\ an J 
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Therefore, the probability that the checks Ch(v,P vw ) and Ch(w, P v > w r) are identical is 
bounded from above by 

(pl + (l-p ) 2 + o(l)) ro <e^ = o(n-*). 

Since by Lemma lTTlT j whp T contains at most 0(n In 4 n) checks of length r , the assertion 
follows. I 

Now, let ro = [7 In n\ . We call a tree T on n vertices typical if: 

• for each pair of paths P vw , P v ' w ' of length r which share at most one vertex, the 
checks Ch(t>; P vw ), Ch(v; P v > w >) are different, 

• for the maximum degree A of T we have 

In n 2 In n 

< A < , 

2 In In n In In n 

• \N< ro \ < 10 8 ln 4 n, for every vertex v, 

• for every vertex v at most lnn/(ln In n) 2 vertices of degree larger than (In Inn) 5 lie 
within distance ro from v. 

Theorem 19 For a typical tree T E T{n) we have D{T) < (1 + o(l)) A. 

Proof. Let T be a typical tree and T' be any other graph which is not isomorphic to T. 
We shall show that then Spoiler can win the Ehrenfeucht game on T and T' in (l + o(l))A 
moves. 

Let us call a vertex v of a graph a yuppie, if there are two paths P vw , P vw i of length r 
starting at v so that V(P VW ) fl V(P VW /) = {v}. Note that the set of all yuppies Y spans 
a subtree in T, call it K. 

Our approach is similar to that for the giant component from Section 0] 

Let us view if as a colored graph where the color of a vertex x is the isomorphism type 
of the component of T — (Y \ {x}) rooted at x. Let Y' be the set of yuppies of T', and 
let K' = T'[Y']. We can assume that Duplicator preserves the subgraphs K and K', for 
otherwise Spoiler wins in extra O (In Inn) moves. 

Claim 1 Any distinct v,v' G K can be distinguished (with respect to G) in O(lnlnn) 
moves. 
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Proof of Claim. Assume that the ro-checkbooks of v, v' are the same for otherwise Spoiler 
wins in log 2 (ro) +0(1) moves. (Please note that the checkbooks are viewed as sets, not 
as multisets, so the number of moves does not depend on the degrees of v and v'.) 

Take a path P vx of length ro, which shares with P m < only vertex v. Spoiler selects x. Let 
Duplicator reply with x'. Assume that Ch(w,P vx ) = Ch(v, P v > x '). The path P v r x > must 
intersect P vx ; thus v G P v > x >. Next, Spoiler selects the P ra -neighbor y of v; Duplicator's 
reply must be y' G P v 'x'- 

Let z G T maximize dist(t> , z) on the condition that Ch(z) = Ch(t> ) and v lies between y 
and z in T. Define the analogous vertex z', replacing v,y in the definition by v',y'. We 
have dist(t>,z) > dist(t>', z'). Let Spoiler select w = z. If Duplicator's reply w' satisfies 
Ch(iy') ^ Ch(w), then Spoiler quickly wins. Otherwise, dist(t>,w) > dist(t>', w'). More- 
over, dist(f,w) < 2r (because their r -checkbooks are non-empty and equal). Spoiler 
wins in log 2 r + 0(1) extra moves. The claim has been proved. I 

Similarly to the argument surrounding ()14j) . one can agrue that for every vertex x G K 
there is a formula $> x (v) of rank O (In Inn) identifying x (with respect to T). Moreover, 
we can assume that this gives us an isomorphism <p : K — > K' which is respected by 
Duplicator. 

As T ^ T", there are two cases to consider. 

Case 1 There is x G K such that T x ^ T' xl) where x' = <fi{x) and T' x , is the component 
of V - (Y' \ {x'}) rooted at x' . 

Since each vertex of T is within distance at most r$ from some yuppie, the tree T x has 
height at most tq. If T' x , has a path of length greater than 2ro or a cycle, then Spoiler 
easily wins, so assume that T' is a tree. Now Spoiler should select all vertices of T x which 
are of degree larger than (In Inn) 5 , say Wi, . . . , w t . Since T is typical there are at most 
lnn/(lnlnn) 2 such vertices in T v . Suppose that, in responce to that, Duplicator chooses 
vertices w^, . . . , w' s in T' x ,. Then, T v \ {wi, . . . , w s } splits into a number of trees F±, . . . , F u , 
colored accordingly to their adjacencies to the u^'s. Now, for some i the multisets of 
colored trees adjacent to Wi and w[ are different. Spoiler can highlight this by using at 
most A(T) + 1 moves. Now Spoiler plays inside some Fi the strategy of Theorem ^ Note 
that Fi has diameter at most 2ro and maximum degree at most (In Inn) 5 . 

Case 2 T' is not connected. 
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As K' = K is connected, there is a component C of T' without a yuppie. Spoiler chooses 
an x' G C. Now, any Duplicator's reply x is within distance r$ from a yuppie, which is 
not true for x' . Spoiler can win in O(lnlnn) moves. 

Consequently, for a typical tree T, 

In 11 

D(T) < A(T) + — - — + 0((lnlnn) 6 ) , 
(in Inn)" 1 

and the assertion follows. I 

Proof of TheoremMiA Theorem IT31 is an immediate consequence of Theorems ITol and 
and the fact that, due to Lemmas ITU and IT7I whp a random tree T 6 T(n) is typical. I 



6 Restricting Alternations 

If Spoiler can win the Ehrenfeucht game, alternating between the graphs G and G' at 
most r times, then the corresponding sentence has the alternation number at most r, 
that is, any chain of nested quantifiers has at most r changes between 3 and V. (To make 
this well-defined, we assume that no quantifier is within the range of a negation sign.) 
Let D r (G) be the smallest depth of a sentence which defines G and has the alternation 
number at most r. It is not hard to see that D r (G) = ma.x{D r (G,G') : G' ^ G}, where 
D r (G, G') may be defined as the smallest k such that Spoiler can win EHRfc(G, G') with 
at most r alternations. For small r, this is a considerable restriction on the structure 
of the corresponding formulas, so let us investigate the alternation number given by our 
strategies. 

Let _D* ree (n, I) be the maximum of D r (T) over all colored trees of order at most n and 
maximum degree at most I. 

Unfortunately, in Theorem we have hardly any control on the number of alternations. 
However, we can show that alternation number suffices if we are happy to increase the 
upper bound by a factor of 2. 

Lemma 20 Let T and T' be colored trees. Suppose that T ^ T', where = stands for 
the isomorphism relation for colored trees, i.e., the underlying (uncolored) trees ofT and 
T' may be isomorphic. Furthermore, assume that v(T) > v(T') and denote n = v(T). 
Assume also that A(T) < I and let both I and Inn/ In/ tend to the infinity. Then Spoiler 
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can win the Ehrenfeucht game on (T, T') in at most 




(19) 



moves playing all time in T . 

Proof. In the first move Spoiler selects a median x G T; let x' be Duplicator's reply. 
If d(x) > d(x'), then Spoiler wins in extra / moves, which is negligible when compared 
to (fTHjl . So, suppose that d(x') > d(x). 

Let t = d(x) and C±, . . . , Ct be the (rooted) components of T — x indexed so that v{Ci) > 
v{C2) > ■ ■ ■ > v(Ct). Referring to the root of a component we mean the vertex of it which 
is adjacent to x. Spoiler starts selecting, one by one, the roots of G\, C 2 , . . .. Duplicator 
is enforced to respond with roots of distinct components of T' — x'. Spoiler keeps doing 
so until the following situation occurs: he selects the root y of a component C = C[ while 
Duplicator selects the root y' of a component C such that v(C) > v(C) and C ^ C (as 
rooted trees). Such a situation really must occur for some i < t due to the conditions 
that v(T) > u(T'), d(x) < d(x'), and T ^ V. 

We claim that if Spoiler selects a vertex z inside C, then Duplicator must reply with 
some z' G C for otherwise Spoiler wins in at most log 2 n moves. Indeed, suppose z' ^ C . 
Spoiler selects Z\ which is a middle point of the yz-p&th. Whatever the reply z[ is, the 
z'zj-path or z[y'-path contains the vertex x'. Suppose it is the z'-Zi-path. Then Spoiler 
halves the ^zi-path. In at most log 2 n times he wins. 

Thus making i + 1 < t+1 < l + l steps, we have reduced the game to two non-isomorphic 
(rooted) trees, C and C, with v(C) < min(4, |) v(T). In the game on (C,C) Spoiler 
applies the same strategy recursively. Two ending conditions are possible: the root of 
C has strictly larger degree than the root of C and Duplicator violates a color, the 
adjacency, or the equality relation. It is easy to argue, cf. the proof of Theorem Q that 
the worst case for us is when we have i = (1 + o(l)) I all the time, which gives the required 
bound (HH. I 

Theorem 21 Let both I and Inn/ In/ tend to the infinity. Then 



Proof. Let T be a tree of order n and maximum degree at most I and let G ^= T. If 
A(T) 7^ A(Cr) then Spoiler wins the Ehrenfeucht game on (T, G) in at most I + 2 moves 



D*»{n,l)< (l + o(l)) 



I Inn 



(20) 



In/ ' 
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playing in the graph of the larger degree. We will therefore assume that T and G have 
the same maximum degree not exceeding 

Case 1 G contains a cycle of length no more than n + 1. 

Spoiler plays in G proceeding as in the last paragraph of the proof of Lemma ["fl 
Case 2 G is connected and has no cycle of length up to n + 1. 

If v(G) < n, then G must be a tree. Lemma 12*0 applies. Let us assume v(G) > n. Let A 
be a set of n + 1 vertices spanning a connected subgraph in G. This subgraph must be 
a tree. Spoiler plays in G staying all time within A. Lemma "2TFI applies. 

Case 3 G is disconnected and has no cycle of length up to n + 1. 

We can assume that every component H of G is a tree for otherwise Spoiler plays the 
game on (T, H) staying in H, using the strategy described above. 

Suppose first that G has a tree component H such that H ^ T and f(-ff) > n. If 
f (if) = n, let T 1 = H. Otherwise let T' be a subtree of H on n + 1 vertices. Spoiler 
plays the game on (T, T") staying in T' and applying the strategy of Lemma "20] (with T 
and T" interchanged and perhaps with n + 1 in place of n). 

Suppose next that all components of G are trees of order less than n. In the first move 
Spoiler selects a median x of T. Let Duplicator respond with a vertex x' in a component 
T' of G. If in the sequel Duplicator makes a move outside T", then Spoiler wins by 
Lemma U3 As long as Duplicator stays in T", Spoiler follows the strategy of Lemma 13201 

Finally, it remains to consider the case that G has a component T' isomorphic to T. 
Spoiler plays in G. In the first move he selects a vertex x' outside T'. Let x denote 
Duplicator's response in T. Starting from the second move Spoiler plays the game on 
(T,T') according to Lemma "2TJI where x is considered colored in a color absent in T' . 

Our description of Spoiler's strategy is complete. I 

It is not clear what the asymptotics of D^ ee (n,l) is. We could not even rule out the 
possibility that D^in, I) = (± + o(l)) . 

The similar method shows that Dq qc (ti, I) = O(lnn) if I > 2 is constant and D^ ee (n, I) = 
©(/) if = 0(1) but the exact asymptotics seems difficult to compute. 

Using these results, one can show that the upper bounds in Theorems [1J21 and El apply to 
Di(G), that is, there are strategies for Spoiler requiring at most one alternation. It is not 
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clear whether alternations is possible here. One of a few places that seem to require an 
alternation is establishing that is a bijection: Spoiler may be forced to start in one of 
the graphs, while later (for example, when showing that T x ^ T' x ,) he may need to swap 
graphs. 
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